Dataset statistics
| Number of variables | 25 |
|---|---|
| Number of observations | 99976 |
| Missing cells | 227059 |
| Missing cells (%) | 9.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 19.1 MiB |
| Average record size in memory | 200.0 B |
Variable types
| Numeric | 23 |
|---|---|
| Categorical | 2 |
account_amount_added_12_24m is highly correlated with num_unpaid_bills and 2 other fields | High correlation |
account_days_in_rem_12_24m is highly correlated with sum_capital_paid_account_12_24m | High correlation |
account_incoming_debt_vs_paid_0_24m is highly correlated with num_unpaid_bills | High correlation |
avg_payment_span_0_12m is highly correlated with avg_payment_span_0_3m and 1 other fields | High correlation |
avg_payment_span_0_3m is highly correlated with avg_payment_span_0_12m | High correlation |
max_paid_inv_0_12m is highly correlated with max_paid_inv_0_24m and 2 other fields | High correlation |
max_paid_inv_0_24m is highly correlated with max_paid_inv_0_12m and 3 other fields | High correlation |
num_active_div_by_paid_inv_0_12m is highly correlated with num_active_inv and 1 other fields | High correlation |
num_active_inv is highly correlated with num_active_div_by_paid_inv_0_12m and 1 other fields | High correlation |
num_arch_ok_0_12m is highly correlated with max_paid_inv_0_12m and 3 other fields | High correlation |
num_arch_ok_12_24m is highly correlated with max_paid_inv_0_24m and 2 other fields | High correlation |
num_arch_rem_0_12m is highly correlated with avg_payment_span_0_12m | High correlation |
num_unpaid_bills is highly correlated with account_amount_added_12_24m and 5 other fields | High correlation |
sum_capital_paid_account_0_12m is highly correlated with account_amount_added_12_24m and 2 other fields | High correlation |
sum_capital_paid_account_12_24m is highly correlated with account_amount_added_12_24m and 3 other fields | High correlation |
sum_paid_inv_0_12m is highly correlated with max_paid_inv_0_12m and 3 other fields | High correlation |
account_amount_added_12_24m is highly correlated with sum_capital_paid_account_0_12m and 1 other fields | High correlation |
avg_payment_span_0_12m is highly correlated with avg_payment_span_0_3m | High correlation |
avg_payment_span_0_3m is highly correlated with avg_payment_span_0_12m | High correlation |
max_paid_inv_0_12m is highly correlated with max_paid_inv_0_24m and 1 other fields | High correlation |
max_paid_inv_0_24m is highly correlated with max_paid_inv_0_12m | High correlation |
num_active_inv is highly correlated with num_arch_ok_0_12m and 2 other fields | High correlation |
num_arch_ok_0_12m is highly correlated with num_active_inv and 2 other fields | High correlation |
num_arch_ok_12_24m is highly correlated with num_active_inv and 2 other fields | High correlation |
sum_capital_paid_account_0_12m is highly correlated with account_amount_added_12_24m and 1 other fields | High correlation |
sum_capital_paid_account_12_24m is highly correlated with account_amount_added_12_24m and 1 other fields | High correlation |
sum_paid_inv_0_12m is highly correlated with max_paid_inv_0_12m and 3 other fields | High correlation |
account_amount_added_12_24m is highly correlated with sum_capital_paid_account_0_12m and 1 other fields | High correlation |
avg_payment_span_0_12m is highly correlated with avg_payment_span_0_3m | High correlation |
avg_payment_span_0_3m is highly correlated with avg_payment_span_0_12m | High correlation |
max_paid_inv_0_12m is highly correlated with max_paid_inv_0_24m and 1 other fields | High correlation |
max_paid_inv_0_24m is highly correlated with max_paid_inv_0_12m and 1 other fields | High correlation |
num_active_div_by_paid_inv_0_12m is highly correlated with num_active_inv and 1 other fields | High correlation |
num_active_inv is highly correlated with num_active_div_by_paid_inv_0_12m and 1 other fields | High correlation |
num_arch_ok_0_12m is highly correlated with num_arch_ok_12_24m and 1 other fields | High correlation |
num_arch_ok_12_24m is highly correlated with num_arch_ok_0_12m and 1 other fields | High correlation |
num_unpaid_bills is highly correlated with num_active_div_by_paid_inv_0_12m and 2 other fields | High correlation |
sum_capital_paid_account_0_12m is highly correlated with account_amount_added_12_24m and 2 other fields | High correlation |
sum_capital_paid_account_12_24m is highly correlated with account_amount_added_12_24m and 1 other fields | High correlation |
sum_paid_inv_0_12m is highly correlated with max_paid_inv_0_12m and 3 other fields | High correlation |
account_amount_added_12_24m is highly correlated with sum_capital_paid_account_0_12m and 1 other fields | High correlation |
account_days_in_dc_12_24m is highly correlated with account_days_in_term_12_24m | High correlation |
account_days_in_term_12_24m is highly correlated with account_days_in_dc_12_24m | High correlation |
avg_payment_span_0_12m is highly correlated with avg_payment_span_0_3m | High correlation |
avg_payment_span_0_3m is highly correlated with avg_payment_span_0_12m | High correlation |
max_paid_inv_0_12m is highly correlated with max_paid_inv_0_24m | High correlation |
max_paid_inv_0_24m is highly correlated with max_paid_inv_0_12m | High correlation |
num_active_inv is highly correlated with num_arch_ok_0_12m and 3 other fields | High correlation |
num_arch_dc_0_12m is highly correlated with num_arch_dc_12_24m | High correlation |
num_arch_dc_12_24m is highly correlated with num_arch_dc_0_12m | High correlation |
num_arch_ok_0_12m is highly correlated with num_active_inv and 3 other fields | High correlation |
num_arch_ok_12_24m is highly correlated with num_active_inv and 2 other fields | High correlation |
num_arch_rem_0_12m is highly correlated with num_arch_ok_0_12m and 1 other fields | High correlation |
num_arch_written_off_12_24m is highly correlated with recovery_debt | High correlation |
num_unpaid_bills is highly correlated with num_active_inv and 1 other fields | High correlation |
recovery_debt is highly correlated with num_arch_written_off_12_24m | High correlation |
sum_capital_paid_account_0_12m is highly correlated with account_amount_added_12_24m and 1 other fields | High correlation |
sum_capital_paid_account_12_24m is highly correlated with account_amount_added_12_24m and 1 other fields | High correlation |
sum_paid_inv_0_12m is highly correlated with num_active_inv and 4 other fields | High correlation |
account_days_in_dc_12_24m has 11836 (11.8%) missing values | Missing |
account_days_in_rem_12_24m has 11836 (11.8%) missing values | Missing |
account_days_in_term_12_24m has 11836 (11.8%) missing values | Missing |
account_incoming_debt_vs_paid_0_24m has 59315 (59.3%) missing values | Missing |
avg_payment_span_0_12m has 23836 (23.8%) missing values | Missing |
avg_payment_span_0_3m has 49305 (49.3%) missing values | Missing |
num_active_div_by_paid_inv_0_12m has 22939 (22.9%) missing values | Missing |
num_arch_written_off_0_12m has 18078 (18.1%) missing values | Missing |
num_arch_written_off_12_24m has 18078 (18.1%) missing values | Missing |
account_days_in_dc_12_24m is highly skewed (γ1 = 38.39324078) | Skewed |
account_incoming_debt_vs_paid_0_24m is highly skewed (γ1 = 100.6863358) | Skewed |
recovery_debt is highly skewed (γ1 = 133.689137) | Skewed |
account_amount_added_12_24m has 71362 (71.4%) zeros | Zeros |
account_days_in_dc_12_24m has 87879 (87.9%) zeros | Zeros |
account_days_in_rem_12_24m has 78522 (78.5%) zeros | Zeros |
account_days_in_term_12_24m has 86932 (87.0%) zeros | Zeros |
account_incoming_debt_vs_paid_0_24m has 13072 (13.1%) zeros | Zeros |
max_paid_inv_0_12m has 21692 (21.7%) zeros | Zeros |
max_paid_inv_0_24m has 17615 (17.6%) zeros | Zeros |
num_active_div_by_paid_inv_0_12m has 48706 (48.7%) zeros | Zeros |
num_active_inv has 69515 (69.5%) zeros | Zeros |
num_arch_dc_0_12m has 95724 (95.7%) zeros | Zeros |
num_arch_dc_12_24m has 95991 (96.0%) zeros | Zeros |
num_arch_ok_0_12m has 27406 (27.4%) zeros | Zeros |
num_arch_ok_12_24m has 37905 (37.9%) zeros | Zeros |
num_arch_rem_0_12m has 76709 (76.7%) zeros | Zeros |
num_unpaid_bills has 52000 (52.0%) zeros | Zeros |
recovery_debt has 99754 (99.8%) zeros | Zeros |
sum_capital_paid_account_0_12m has 66011 (66.0%) zeros | Zeros |
sum_capital_paid_account_12_24m has 74788 (74.8%) zeros | Zeros |
sum_paid_inv_0_12m has 21692 (21.7%) zeros | Zeros |
Reproduction
| Analysis started | 2022-09-22 14:43:48.293638 |
|---|---|
| Analysis finished | 2022-09-22 14:44:45.314897 |
| Duration | 57.02 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
account_amount_added_12_24m
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 23721 |
|---|---|
| Distinct (%) | 23.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12255.14952 |
| Minimum | 0 |
|---|---|
| Maximum | 1128775 |
| Zeros | 71362 |
| Zeros (%) | 71.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 4937.25 |
| 95-th percentile | 72967 |
| Maximum | 1128775 |
| Range | 1128775 |
| Interquartile range (IQR) | 4937.25 |
Descriptive statistics
| Standard deviation | 35481.48374 |
|---|---|
| Coefficient of variation (CV) | 2.895230588 |
| Kurtosis | 91.75787351 |
| Mean | 12255.14952 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.767239749 |
| Sum | 1225220828 |
| Variance | 1258935688 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 71362 | |
| 50 | 34 | < 0.1% |
| 30 | 34 | < 0.1% |
| 90 | 25 | < 0.1% |
| 60 | 22 | < 0.1% |
| 100 | 19 | < 0.1% |
| 20 | 12 | < 0.1% |
| 80 | 10 | < 0.1% |
| 40 | 8 | < 0.1% |
| 7431 | 6 | < 0.1% |
| Other values (23711) | 28444 | 28.5% |
| Value | Count | Frequency (%) |
| 0 | 71362 | |
| 1 | 1 | < 0.1% |
| 11 | 6 | < 0.1% |
| 13 | 2 | < 0.1% |
| 14 | 4 | < 0.1% |
| 15 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 17 | 6 | < 0.1% |
| 18 | 4 | < 0.1% |
| 20 | 12 | < 0.1% |
| Value | Count | Frequency (%) |
| 1128775 | 1 | |
| 1128654 | 1 | |
| 963598 | 1 | |
| 963594 | 1 | |
| 963477 | 1 | |
| 913943 | 1 | |
| 913805 | 1 | |
| 817134 | 1 | |
| 817029 | 1 | |
| 751971 | 1 |
| Distinct | 127 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 11836 |
| Missing (%) | 11.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2230428863 |
| Minimum | 0 |
|---|---|
| Maximum | 365 |
| Zeros | 87879 |
| Zeros (%) | 87.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 365 |
| Range | 365 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 5.808116523 |
|---|---|
| Coefficient of variation (CV) | 26.04035761 |
| Kurtosis | 1776.48798 |
| Mean | 0.2230428863 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 38.39324078 |
| Sum | 19659 |
| Variance | 33.73421754 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 87879 | |
| 9 | 11 | < 0.1% |
| 28 | 10 | < 0.1% |
| 42 | 9 | < 0.1% |
| 67 | 9 | < 0.1% |
| 7 | 8 | < 0.1% |
| 56 | 7 | < 0.1% |
| 35 | 7 | < 0.1% |
| 99 | 7 | < 0.1% |
| 43 | 6 | < 0.1% |
| Other values (117) | 187 | 0.2% |
| (Missing) | 11836 | 11.8% |
| Value | Count | Frequency (%) |
| 0 | 87879 | |
| 1 | 1 | < 0.1% |
| 3 | 6 | < 0.1% |
| 4 | 2 | < 0.1% |
| 5 | 1 | < 0.1% |
| 7 | 8 | < 0.1% |
| 9 | 11 | < 0.1% |
| 10 | 3 | < 0.1% |
| 11 | 3 | < 0.1% |
| 12 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 365 | 1 | |
| 362 | 1 | |
| 350 | 2 | |
| 322 | 1 | |
| 318 | 1 | |
| 316 | 1 | |
| 291 | 1 | |
| 289 | 2 | |
| 276 | 1 | |
| 271 | 1 |
| Distinct | 282 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 11836 |
| Missing (%) | 11.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.044622192 |
| Minimum | 0 |
|---|---|
| Maximum | 365 |
| Zeros | 78522 |
| Zeros (%) | 78.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 31 |
| Maximum | 365 |
| Range | 365 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 22.86397119 |
|---|---|
| Coefficient of variation (CV) | 4.532345598 |
| Kurtosis | 76.90992083 |
| Mean | 5.044622192 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.545146569 |
| Sum | 444633 |
| Variance | 522.7611784 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 78522 | |
| 1 | 529 | 0.5% |
| 2 | 315 | 0.3% |
| 21 | 258 | 0.3% |
| 15 | 236 | 0.2% |
| 16 | 221 | 0.2% |
| 3 | 214 | 0.2% |
| 14 | 212 | 0.2% |
| 22 | 190 | 0.2% |
| 13 | 184 | 0.2% |
| Other values (272) | 7259 | 7.3% |
| (Missing) | 11836 | 11.8% |
| Value | Count | Frequency (%) |
| 0 | 78522 | |
| 1 | 529 | 0.5% |
| 2 | 315 | 0.3% |
| 3 | 214 | 0.2% |
| 4 | 172 | 0.2% |
| 5 | 87 | 0.1% |
| 6 | 129 | 0.1% |
| 7 | 178 | 0.2% |
| 8 | 153 | 0.2% |
| 9 | 137 | 0.1% |
| Value | Count | Frequency (%) |
| 365 | 50 | |
| 362 | 1 | < 0.1% |
| 358 | 1 | < 0.1% |
| 356 | 1 | < 0.1% |
| 354 | 2 | < 0.1% |
| 353 | 1 | < 0.1% |
| 351 | 1 | < 0.1% |
| 346 | 1 | < 0.1% |
| 343 | 1 | < 0.1% |
| 341 | 1 | < 0.1% |
| Distinct | 64 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 11836 |
| Missing (%) | 11.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2868958475 |
| Minimum | 0 |
|---|---|
| Maximum | 97 |
| Zeros | 86932 |
| Zeros (%) | 87.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 97 |
| Range | 97 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2.929910479 |
|---|---|
| Coefficient of variation (CV) | 10.21245342 |
| Kurtosis | 188.5714584 |
| Mean | 0.2868958475 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 12.49859581 |
| Sum | 25287 |
| Variance | 8.584375415 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 86932 | |
| 34 | 274 | 0.3% |
| 7 | 56 | 0.1% |
| 1 | 52 | 0.1% |
| 2 | 49 | < 0.1% |
| 11 | 42 | < 0.1% |
| 22 | 39 | < 0.1% |
| 8 | 38 | < 0.1% |
| 15 | 38 | < 0.1% |
| 23 | 36 | < 0.1% |
| Other values (54) | 584 | 0.6% |
| (Missing) | 11836 | 11.8% |
| Value | Count | Frequency (%) |
| 0 | 86932 | |
| 1 | 52 | 0.1% |
| 2 | 49 | < 0.1% |
| 3 | 30 | < 0.1% |
| 4 | 20 | < 0.1% |
| 5 | 21 | < 0.1% |
| 6 | 24 | < 0.1% |
| 7 | 56 | 0.1% |
| 8 | 38 | < 0.1% |
| 9 | 31 | < 0.1% |
| Value | Count | Frequency (%) |
| 97 | 2 | < 0.1% |
| 91 | 1 | < 0.1% |
| 80 | 1 | < 0.1% |
| 68 | 1 | < 0.1% |
| 67 | 2 | < 0.1% |
| 65 | 3 | |
| 64 | 1 | < 0.1% |
| 63 | 5 | |
| 61 | 1 | < 0.1% |
| 60 | 2 | < 0.1% |
| Distinct | 23674 |
|---|---|
| Distinct (%) | 58.2% |
| Missing | 59315 |
| Missing (%) | 59.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.331291764 |
| Minimum | 0 |
|---|---|
| Maximum | 3914 |
| Zeros | 13072 |
| Zeros (%) | 13.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.1520819113 |
| Q3 | 0.662952183 |
| 95-th percentile | 2.792796942 |
| Maximum | 3914 |
| Range | 3914 |
| Interquartile range (IQR) | 0.662952183 |
Descriptive statistics
| Standard deviation | 26.48229928 |
|---|---|
| Coefficient of variation (CV) | 19.8921829 |
| Kurtosis | 12826.8267 |
| Mean | 1.331291764 |
| Median Absolute Deviation (MAD) | 0.1520819113 |
| Skewness | 100.6863358 |
| Sum | 54131.65444 |
| Variance | 701.312175 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 13072 | 13.1% |
| 8.03442623 | 57 | 0.1% |
| 0.004300732717 | 18 | < 0.1% |
| 2.151462995 × 10-5 | 17 | < 0.1% |
| 0.003650190114 | 15 | < 0.1% |
| 1.101746268 × 10-5 | 14 | < 0.1% |
| 0.0001136040897 | 12 | < 0.1% |
| 0.01112487991 | 12 | < 0.1% |
| 61.17605634 | 11 | < 0.1% |
| 0.000968054211 | 11 | < 0.1% |
| Other values (23664) | 27422 | |
| (Missing) | 59315 |
| Value | Count | Frequency (%) |
| 0 | 13072 | |
| 3.788294926 × 10-6 | 3 | < 0.1% |
| 4.933934615 × 10-6 | 2 | < 0.1% |
| 5.956245421 × 10-6 | 1 | < 0.1% |
| 6.034529578 × 10-6 | 1 | < 0.1% |
| 6.170173382 × 10-6 | 1 | < 0.1% |
| 6.584318786 × 10-6 | 1 | < 0.1% |
| 6.708482877 × 10-6 | 1 | < 0.1% |
| 7.028048943 × 10-6 | 2 | < 0.1% |
| 7.125907662 × 10-6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 3914 | 1 | < 0.1% |
| 1443.48 | 1 | < 0.1% |
| 1435.58 | 3 | |
| 1336.935484 | 1 | < 0.1% |
| 1176.888889 | 1 | < 0.1% |
| 329.4814815 | 1 | < 0.1% |
| 299.7321429 | 1 | < 0.1% |
| 294 | 1 | < 0.1% |
| 270.195122 | 1 | < 0.1% |
| 245.6086957 | 1 | < 0.1% |
age
Real number (ℝ≥0)
| Distinct | 79 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 36.01628391 |
| Minimum | 18 |
|---|---|
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 19 |
| Q1 | 25 |
| median | 34 |
| Q3 | 45 |
| 95-th percentile | 60 |
| Maximum | 100 |
| Range | 82 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 13.00130583 |
|---|---|
| Coefficient of variation (CV) | 0.3609841001 |
| Kurtosis | -0.04225289525 |
| Mean | 36.01628391 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 0.6895777822 |
| Sum | 3600764 |
| Variance | 169.0339534 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 18 | 3704 | 3.7% |
| 22 | 3477 | 3.5% |
| 21 | 3470 | 3.5% |
| 20 | 3258 | 3.3% |
| 23 | 3256 | 3.3% |
| 24 | 3013 | 3.0% |
| 28 | 3004 | 3.0% |
| 26 | 2979 | 3.0% |
| 29 | 2967 | 3.0% |
| 25 | 2949 | 2.9% |
| Other values (69) | 67899 |
| Value | Count | Frequency (%) |
| 18 | 3704 | |
| 19 | 2653 | |
| 20 | 3258 | |
| 21 | 3470 | |
| 22 | 3477 | |
| 23 | 3256 | |
| 24 | 3013 | |
| 25 | 2949 | |
| 26 | 2979 | |
| 27 | 2920 |
| Value | Count | Frequency (%) |
| 100 | 1 | < 0.1% |
| 95 | 1 | < 0.1% |
| 94 | 1 | < 0.1% |
| 93 | 2 | < 0.1% |
| 92 | 1 | < 0.1% |
| 91 | 2 | < 0.1% |
| 90 | 2 | < 0.1% |
| 89 | 4 | |
| 88 | 6 | |
| 87 | 5 |
avg_payment_span_0_12m
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 7939 |
|---|---|
| Distinct (%) | 10.4% |
| Missing | 23836 |
| Missing (%) | 23.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.97147269 |
| Minimum | 0 |
|---|---|
| Maximum | 260 |
| Zeros | 500 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5.135063559 |
| Q1 | 10.8 |
| median | 14.90909091 |
| Q3 | 21 |
| 95-th percentile | 41 |
| Maximum | 260 |
| Range | 260 |
| Interquartile range (IQR) | 10.2 |
Descriptive statistics
| Standard deviation | 12.75106572 |
|---|---|
| Coefficient of variation (CV) | 0.7095170184 |
| Kurtosis | 20.71225599 |
| Mean | 17.97147269 |
| Median Absolute Deviation (MAD) | 4.909090909 |
| Skewness | 3.203545818 |
| Sum | 1368347.931 |
| Variance | 162.589677 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 14 | 2144 | 2.1% |
| 13 | 1833 | 1.8% |
| 15 | 1281 | 1.3% |
| 12 | 1227 | 1.2% |
| 16 | 1166 | 1.2% |
| 11 | 1081 | 1.1% |
| 10 | 1037 | 1.0% |
| 9 | 996 | 1.0% |
| 17 | 962 | 1.0% |
| 7 | 902 | 0.9% |
| Other values (7929) | 63511 | |
| (Missing) | 23836 | 23.8% |
| Value | Count | Frequency (%) |
| 0 | 500 | |
| 0.1666666667 | 3 | < 0.1% |
| 0.2 | 1 | < 0.1% |
| 0.2222222222 | 1 | < 0.1% |
| 0.25 | 3 | < 0.1% |
| 0.2857142857 | 1 | < 0.1% |
| 0.3333333333 | 10 | < 0.1% |
| 0.375 | 1 | < 0.1% |
| 0.5 | 37 | < 0.1% |
| 0.5161290323 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 260 | 1 | |
| 224 | 1 | |
| 217 | 1 | |
| 204 | 1 | |
| 187 | 1 | |
| 184 | 1 | |
| 182 | 1 | |
| 174 | 1 | |
| 173 | 1 | |
| 169 | 2 |
avg_payment_span_0_3m
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 2256 |
|---|---|
| Distinct (%) | 4.5% |
| Missing | 49305 |
| Missing (%) | 49.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14.98978561 |
| Minimum | 0 |
|---|---|
| Maximum | 87 |
| Zeros | 802 |
| Zeros (%) | 0.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 8.4 |
| median | 13 |
| Q3 | 18.28571429 |
| 95-th percentile | 36 |
| Maximum | 87 |
| Range | 87 |
| Interquartile range (IQR) | 9.885714286 |
Descriptive statistics
| Standard deviation | 10.29742038 |
|---|---|
| Coefficient of variation (CV) | 0.6869624853 |
| Kurtosis | 4.824439581 |
| Mean | 14.98978561 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 1.783164222 |
| Sum | 759547.4267 |
| Variance | 106.0368664 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 14 | 2764 | 2.8% |
| 13 | 2269 | 2.3% |
| 6 | 1348 | 1.3% |
| 12 | 1335 | 1.3% |
| 16 | 1331 | 1.3% |
| 7 | 1327 | 1.3% |
| 15 | 1295 | 1.3% |
| 10 | 1252 | 1.3% |
| 11 | 1211 | 1.2% |
| 9 | 1150 | 1.2% |
| Other values (2246) | 35389 | |
| (Missing) | 49305 |
| Value | Count | Frequency (%) |
| 0 | 802 | |
| 0.08333333333 | 1 | < 0.1% |
| 0.1666666667 | 3 | < 0.1% |
| 0.2 | 1 | < 0.1% |
| 0.25 | 4 | < 0.1% |
| 0.2857142857 | 1 | < 0.1% |
| 0.3333333333 | 10 | < 0.1% |
| 0.3636363636 | 1 | < 0.1% |
| 0.3888888889 | 1 | < 0.1% |
| 0.4 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 87 | 1 | < 0.1% |
| 86 | 1 | < 0.1% |
| 84 | 5 | |
| 83 | 5 | |
| 82 | 1 | < 0.1% |
| 81 | 2 | < 0.1% |
| 80 | 4 | |
| 79.66666667 | 1 | < 0.1% |
| 79 | 1 | < 0.1% |
| 78 | 2 | < 0.1% |
max_paid_inv_0_12m
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 12497 |
|---|---|
| Distinct (%) | 12.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9203.654217 |
| Minimum | 0 |
|---|---|
| Maximum | 279000 |
| Zeros | 21692 |
| Zeros (%) | 21.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2000 |
| median | 6052 |
| Q3 | 11380 |
| 95-th percentile | 29272.5 |
| Maximum | 279000 |
| Range | 279000 |
| Interquartile range (IQR) | 9380 |
Descriptive statistics
| Standard deviation | 13512.16723 |
|---|---|
| Coefficient of variation (CV) | 1.468130691 |
| Kurtosis | 56.10386348 |
| Mean | 9203.654217 |
| Median Absolute Deviation (MAD) | 4772 |
| Skewness | 5.653271466 |
| Sum | 920144534 |
| Variance | 182578663.2 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 21692 | 21.7% |
| 5290 | 440 | 0.4% |
| 895 | 397 | 0.4% |
| 4290 | 364 | 0.4% |
| 5000 | 290 | 0.3% |
| 6790 | 278 | 0.3% |
| 4790 | 274 | 0.3% |
| 3290 | 261 | 0.3% |
| 2290 | 251 | 0.3% |
| 6290 | 237 | 0.2% |
| Other values (12487) | 75492 |
| Value | Count | Frequency (%) |
| 0 | 21692 | |
| 90 | 1 | < 0.1% |
| 175 | 2 | < 0.1% |
| 210 | 1 | < 0.1% |
| 270 | 2 | < 0.1% |
| 290 | 2 | < 0.1% |
| 295 | 4 | < 0.1% |
| 300 | 3 | < 0.1% |
| 320 | 1 | < 0.1% |
| 340 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 279000 | 1 | < 0.1% |
| 270295 | 1 | < 0.1% |
| 264300 | 3 | |
| 260395 | 1 | < 0.1% |
| 251890 | 1 | < 0.1% |
| 245110 | 2 | < 0.1% |
| 240000 | 1 | < 0.1% |
| 235790 | 2 | < 0.1% |
| 233890 | 7 | |
| 230545 | 1 | < 0.1% |
max_paid_inv_0_24m
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 12932 |
|---|---|
| Distinct (%) | 12.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11215.12082 |
| Minimum | 0 |
|---|---|
| Maximum | 538500 |
| Zeros | 17615 |
| Zeros (%) | 17.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3350 |
| median | 7580 |
| Q3 | 13783 |
| 95-th percentile | 34295 |
| Maximum | 538500 |
| Range | 538500 |
| Interquartile range (IQR) | 10433 |
Descriptive statistics
| Standard deviation | 15256.41494 |
|---|---|
| Coefficient of variation (CV) | 1.360343342 |
| Kurtosis | 56.77507679 |
| Mean | 11215.12082 |
| Median Absolute Deviation (MAD) | 5005 |
| Skewness | 5.367850225 |
| Sum | 1121242919 |
| Variance | 232758196.7 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 17615 | 17.6% |
| 5290 | 433 | 0.4% |
| 4290 | 280 | 0.3% |
| 5000 | 249 | 0.2% |
| 6290 | 249 | 0.2% |
| 895 | 247 | 0.2% |
| 9290 | 245 | 0.2% |
| 6790 | 240 | 0.2% |
| 3290 | 235 | 0.2% |
| 4790 | 225 | 0.2% |
| Other values (12922) | 79958 |
| Value | Count | Frequency (%) |
| 0 | 17615 | |
| 90 | 1 | < 0.1% |
| 175 | 1 | < 0.1% |
| 210 | 2 | < 0.1% |
| 270 | 1 | < 0.1% |
| 290 | 1 | < 0.1% |
| 295 | 3 | < 0.1% |
| 300 | 3 | < 0.1% |
| 320 | 1 | < 0.1% |
| 370 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 538500 | 1 | < 0.1% |
| 279000 | 1 | < 0.1% |
| 270295 | 1 | < 0.1% |
| 264300 | 3 | |
| 260395 | 1 | < 0.1% |
| 251890 | 1 | < 0.1% |
| 245110 | 2 | |
| 240000 | 1 | < 0.1% |
| 235790 | 2 | |
| 234995 | 1 | < 0.1% |
| Distinct | 861 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 22939 |
| Missing (%) | 22.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.114840286 |
| Minimum | 0 |
|---|---|
| Maximum | 9 |
| Zeros | 48706 |
| Zeros (%) | 48.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0.1 |
| 95-th percentile | 0.5 |
| Maximum | 9 |
| Range | 9 |
| Interquartile range (IQR) | 0.1 |
Descriptive statistics
| Standard deviation | 0.293483024 |
|---|---|
| Coefficient of variation (CV) | 2.555575525 |
| Kurtosis | 82.01839065 |
| Mean | 0.114840286 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.366929886 |
| Sum | 8846.951109 |
| Variance | 0.08613228539 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 48706 | |
| 1 | 2425 | 2.4% |
| 0.5 | 2273 | 2.3% |
| 0.3333333333 | 1918 | 1.9% |
| 0.25 | 1683 | 1.7% |
| 0.2 | 1490 | 1.5% |
| 0.1666666667 | 1270 | 1.3% |
| 0.1428571429 | 1098 | 1.1% |
| 0.125 | 905 | 0.9% |
| 0.1111111111 | 826 | 0.8% |
| Other values (851) | 14443 | 14.4% |
| (Missing) | 22939 |
| Value | Count | Frequency (%) |
| 0 | 48706 | |
| 0.006666666667 | 1 | < 0.1% |
| 0.007142857143 | 1 | < 0.1% |
| 0.007299270073 | 3 | < 0.1% |
| 0.007462686567 | 1 | < 0.1% |
| 0.007518796992 | 1 | < 0.1% |
| 0.007936507937 | 4 | < 0.1% |
| 0.00826446281 | 1 | < 0.1% |
| 0.008333333333 | 1 | < 0.1% |
| 0.008474576271 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 9 | 2 | < 0.1% |
| 8 | 1 | < 0.1% |
| 7 | 2 | < 0.1% |
| 6 | 5 | < 0.1% |
| 5 | 10 | < 0.1% |
| 4.5 | 1 | < 0.1% |
| 4 | 14 | < 0.1% |
| 3.5 | 2 | < 0.1% |
| 3 | 75 | |
| 2.5 | 8 | < 0.1% |
num_active_inv
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 37 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5994038569 |
| Minimum | 0 |
|---|---|
| Maximum | 47 |
| Zeros | 69515 |
| Zeros (%) | 69.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 47 |
| Range | 47 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.550026416 |
|---|---|
| Coefficient of variation (CV) | 2.585946684 |
| Kurtosis | 108.5395278 |
| Mean | 0.5994038569 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.918090283 |
| Sum | 59926 |
| Variance | 2.40258189 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 69515 | |
| 1 | 18493 | 18.5% |
| 2 | 6250 | 6.3% |
| 3 | 2529 | 2.5% |
| 4 | 1207 | 1.2% |
| 5 | 649 | 0.6% |
| 6 | 398 | 0.4% |
| 7 | 225 | 0.2% |
| 8 | 154 | 0.2% |
| 9 | 111 | 0.1% |
| Other values (27) | 445 | 0.4% |
| Value | Count | Frequency (%) |
| 0 | 69515 | |
| 1 | 18493 | 18.5% |
| 2 | 6250 | 6.3% |
| 3 | 2529 | 2.5% |
| 4 | 1207 | 1.2% |
| 5 | 649 | 0.6% |
| 6 | 398 | 0.4% |
| 7 | 225 | 0.2% |
| 8 | 154 | 0.2% |
| 9 | 111 | 0.1% |
| Value | Count | Frequency (%) |
| 47 | 1 | < 0.1% |
| 38 | 2 | < 0.1% |
| 37 | 3 | |
| 35 | 2 | < 0.1% |
| 33 | 4 | |
| 31 | 1 | < 0.1% |
| 30 | 2 | < 0.1% |
| 29 | 3 | |
| 28 | 7 | |
| 27 | 7 |
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.06174481876 |
| Minimum | 0 |
|---|---|
| Maximum | 17 |
| Zeros | 95724 |
| Zeros (%) | 95.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 17 |
| Range | 17 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.3746913273 |
|---|---|
| Coefficient of variation (CV) | 6.068384924 |
| Kurtosis | 265.3512069 |
| Mean | 0.06174481876 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 12.27561745 |
| Sum | 6173 |
| Variance | 0.1403935907 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 95724 | |
| 1 | 3192 | 3.2% |
| 2 | 681 | 0.7% |
| 3 | 196 | 0.2% |
| 4 | 77 | 0.1% |
| 6 | 35 | < 0.1% |
| 5 | 30 | < 0.1% |
| 7 | 21 | < 0.1% |
| 8 | 5 | < 0.1% |
| 13 | 4 | < 0.1% |
| Other values (5) | 11 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 95724 | |
| 1 | 3192 | 3.2% |
| 2 | 681 | 0.7% |
| 3 | 196 | 0.2% |
| 4 | 77 | 0.1% |
| 5 | 30 | < 0.1% |
| 6 | 35 | < 0.1% |
| 7 | 21 | < 0.1% |
| 8 | 5 | < 0.1% |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 17 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 13 | 4 | < 0.1% |
| 11 | 3 | < 0.1% |
| 10 | 4 | < 0.1% |
| 9 | 2 | < 0.1% |
| 8 | 5 | < 0.1% |
| 7 | 21 | |
| 6 | 35 | |
| 5 | 30 |
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.05936424742 |
| Minimum | 0 |
|---|---|
| Maximum | 13 |
| Zeros | 95991 |
| Zeros (%) | 96.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 13 |
| Range | 13 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.3662243329 |
|---|---|
| Coefficient of variation (CV) | 6.169105965 |
| Kurtosis | 184.2389426 |
| Mean | 0.05936424742 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 10.91096016 |
| Sum | 5935 |
| Variance | 0.134120262 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 95991 | |
| 1 | 2913 | 2.9% |
| 2 | 665 | 0.7% |
| 3 | 195 | 0.2% |
| 4 | 106 | 0.1% |
| 5 | 46 | < 0.1% |
| 7 | 24 | < 0.1% |
| 6 | 17 | < 0.1% |
| 10 | 6 | < 0.1% |
| 8 | 6 | < 0.1% |
| Other values (3) | 7 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 95991 | |
| 1 | 2913 | 2.9% |
| 2 | 665 | 0.7% |
| 3 | 195 | 0.2% |
| 4 | 106 | 0.1% |
| 5 | 46 | < 0.1% |
| 6 | 17 | < 0.1% |
| 7 | 24 | < 0.1% |
| 8 | 6 | < 0.1% |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 13 | 1 | < 0.1% |
| 11 | 4 | < 0.1% |
| 10 | 6 | < 0.1% |
| 9 | 2 | < 0.1% |
| 8 | 6 | < 0.1% |
| 7 | 24 | < 0.1% |
| 6 | 17 | < 0.1% |
| 5 | 46 | < 0.1% |
| 4 | 106 | |
| 3 | 195 |
num_arch_ok_0_12m
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 201 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.275826198 |
| Minimum | 0 |
|---|---|
| Maximum | 261 |
| Zeros | 27406 |
| Zeros (%) | 27.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 2 |
| Q3 | 7 |
| 95-th percentile | 30 |
| Maximum | 261 |
| Range | 261 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 16.03036935 |
|---|---|
| Coefficient of variation (CV) | 2.203236981 |
| Kurtosis | 46.01920749 |
| Mean | 7.275826198 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 5.722424978 |
| Sum | 727408 |
| Variance | 256.9727414 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 27406 | |
| 1 | 14098 | |
| 2 | 9929 | 9.9% |
| 3 | 7370 | 7.4% |
| 4 | 5808 | 5.8% |
| 5 | 4717 | 4.7% |
| 6 | 3822 | 3.8% |
| 7 | 3130 | 3.1% |
| 8 | 2497 | 2.5% |
| 9 | 2126 | 2.1% |
| Other values (191) | 19073 |
| Value | Count | Frequency (%) |
| 0 | 27406 | |
| 1 | 14098 | |
| 2 | 9929 | 9.9% |
| 3 | 7370 | 7.4% |
| 4 | 5808 | 5.8% |
| 5 | 4717 | 4.7% |
| 6 | 3822 | 3.8% |
| 7 | 3130 | 3.1% |
| 8 | 2497 | 2.5% |
| 9 | 2126 | 2.1% |
| Value | Count | Frequency (%) |
| 261 | 1 | < 0.1% |
| 248 | 1 | < 0.1% |
| 247 | 1 | < 0.1% |
| 236 | 1 | < 0.1% |
| 232 | 1 | < 0.1% |
| 231 | 1 | < 0.1% |
| 225 | 5 | |
| 224 | 3 | |
| 223 | 5 | |
| 222 | 7 |
num_arch_ok_12_24m
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 200 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.369798752 |
| Minimum | 0 |
|---|---|
| Maximum | 313 |
| Zeros | 37905 |
| Zeros (%) | 37.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 2 |
| Q3 | 6 |
| 95-th percentile | 28 |
| Maximum | 313 |
| Range | 313 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 15.35024427 |
|---|---|
| Coefficient of variation (CV) | 2.409847606 |
| Kurtosis | 82.59847026 |
| Mean | 6.369798752 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 7.153601334 |
| Sum | 636827 |
| Variance | 235.6299992 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 37905 | |
| 1 | 11038 | 11.0% |
| 2 | 8305 | 8.3% |
| 3 | 6255 | 6.3% |
| 4 | 4824 | 4.8% |
| 5 | 4138 | 4.1% |
| 6 | 3415 | 3.4% |
| 7 | 2761 | 2.8% |
| 8 | 2369 | 2.4% |
| 9 | 1906 | 1.9% |
| Other values (190) | 17060 |
| Value | Count | Frequency (%) |
| 0 | 37905 | |
| 1 | 11038 | 11.0% |
| 2 | 8305 | 8.3% |
| 3 | 6255 | 6.3% |
| 4 | 4824 | 4.8% |
| 5 | 4138 | 4.1% |
| 6 | 3415 | 3.4% |
| 7 | 2761 | 2.8% |
| 8 | 2369 | 2.4% |
| 9 | 1906 | 1.9% |
| Value | Count | Frequency (%) |
| 313 | 1 | < 0.1% |
| 304 | 1 | < 0.1% |
| 302 | 2 | < 0.1% |
| 301 | 1 | < 0.1% |
| 293 | 6 | |
| 292 | 2 | < 0.1% |
| 290 | 2 | < 0.1% |
| 288 | 1 | < 0.1% |
| 278 | 1 | < 0.1% |
| 277 | 2 | < 0.1% |
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4694426662 |
| Minimum | 0 |
|---|---|
| Maximum | 42 |
| Zeros | 76709 |
| Zeros (%) | 76.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 42 |
| Range | 42 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.360348893 |
|---|---|
| Coefficient of variation (CV) | 2.897795601 |
| Kurtosis | 129.526541 |
| Mean | 0.4694426662 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.358018709 |
| Sum | 46933 |
| Variance | 1.850549111 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 76709 | |
| 1 | 13521 | 13.5% |
| 2 | 4758 | 4.8% |
| 3 | 2270 | 2.3% |
| 4 | 1092 | 1.1% |
| 5 | 602 | 0.6% |
| 6 | 322 | 0.3% |
| 7 | 222 | 0.2% |
| 8 | 115 | 0.1% |
| 9 | 80 | 0.1% |
| Other values (21) | 285 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 76709 | |
| 1 | 13521 | 13.5% |
| 2 | 4758 | 4.8% |
| 3 | 2270 | 2.3% |
| 4 | 1092 | 1.1% |
| 5 | 602 | 0.6% |
| 6 | 322 | 0.3% |
| 7 | 222 | 0.2% |
| 8 | 115 | 0.1% |
| 9 | 80 | 0.1% |
| Value | Count | Frequency (%) |
| 42 | 3 | < 0.1% |
| 39 | 2 | < 0.1% |
| 29 | 10 | |
| 27 | 7 | |
| 26 | 5 | < 0.1% |
| 25 | 7 | |
| 24 | 16 | |
| 23 | 6 | < 0.1% |
| 22 | 1 | < 0.1% |
| 21 | 8 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 18078 |
| Missing (%) | 18.1% |
| Memory size | 781.2 KiB |
| 0.0 | |
|---|---|
| 1.0 | 10 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 81888 | |
| 1.0 | 10 | < 0.1% |
| (Missing) | 18078 | 18.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0.0 | 81888 | |
| 1.0 | 10 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 18078 |
| Missing (%) | 18.1% |
| Memory size | 781.2 KiB |
| 0.0 | |
|---|---|
| 1.0 | 9 |
| 2.0 | 2 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 81887 | |
| 1.0 | 9 | < 0.1% |
| 2.0 | 2 | < 0.1% |
| (Missing) | 18078 | 18.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0.0 | 81887 | |
| 1.0 | 9 | < 0.1% |
| 2.0 | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 132 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.141563975 |
| Minimum | 0 |
|---|---|
| Maximum | 182 |
| Zeros | 52000 |
| Zeros (%) | 52.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 2 |
| 95-th percentile | 10 |
| Maximum | 182 |
| Range | 182 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 6.300977704 |
|---|---|
| Coefficient of variation (CV) | 2.942231835 |
| Kurtosis | 173.107918 |
| Mean | 2.141563975 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 10.40120692 |
| Sum | 214105 |
| Variance | 39.70232003 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 52000 | |
| 1 | 19300 | 19.3% |
| 2 | 9216 | 9.2% |
| 3 | 5007 | 5.0% |
| 4 | 3034 | 3.0% |
| 5 | 2016 | 2.0% |
| 6 | 1519 | 1.5% |
| 7 | 1157 | 1.2% |
| 8 | 891 | 0.9% |
| 9 | 756 | 0.8% |
| Other values (122) | 5080 | 5.1% |
| Value | Count | Frequency (%) |
| 0 | 52000 | |
| 1 | 19300 | 19.3% |
| 2 | 9216 | 9.2% |
| 3 | 5007 | 5.0% |
| 4 | 3034 | 3.0% |
| 5 | 2016 | 2.0% |
| 6 | 1519 | 1.5% |
| 7 | 1157 | 1.2% |
| 8 | 891 | 0.9% |
| 9 | 756 | 0.8% |
| Value | Count | Frequency (%) |
| 182 | 1 | < 0.1% |
| 162 | 1 | < 0.1% |
| 160 | 2 | |
| 159 | 1 | < 0.1% |
| 158 | 1 | < 0.1% |
| 153 | 3 | |
| 152 | 1 | < 0.1% |
| 150 | 1 | < 0.1% |
| 149 | 1 | < 0.1% |
| 147 | 2 |
| Distinct | 111 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.035428503 |
| Minimum | 0 |
|---|---|
| Maximum | 36479 |
| Zeros | 99754 |
| Zeros (%) | 99.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 36479 |
| Range | 36479 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 163.934564 |
|---|---|
| Coefficient of variation (CV) | 40.62383062 |
| Kurtosis | 26142.93973 |
| Mean | 4.035428503 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 133.689137 |
| Sum | 403446 |
| Variance | 26874.54126 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 99754 | |
| 500 | 47 | < 0.1% |
| 1000 | 24 | < 0.1% |
| 1500 | 7 | < 0.1% |
| 2190 | 6 | < 0.1% |
| 601 | 4 | < 0.1% |
| 1275 | 4 | < 0.1% |
| 1939 | 3 | < 0.1% |
| 2080 | 3 | < 0.1% |
| 2580 | 3 | < 0.1% |
| Other values (101) | 121 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 99754 | |
| 47 | 1 | < 0.1% |
| 90 | 1 | < 0.1% |
| 99 | 1 | < 0.1% |
| 348 | 2 | < 0.1% |
| 400 | 1 | < 0.1% |
| 500 | 47 | < 0.1% |
| 519 | 1 | < 0.1% |
| 521 | 1 | < 0.1% |
| 530 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 36479 | 1 | |
| 16411 | 1 | |
| 11190 | 1 | |
| 10230 | 1 | |
| 7910 | 1 | |
| 7200 | 2 | |
| 6630 | 1 | |
| 6285 | 1 | |
| 6065 | 1 | |
| 5590 | 1 |
sum_capital_paid_account_0_12m
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 22580 |
|---|---|
| Distinct (%) | 22.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10816.06539 |
| Minimum | 0 |
|---|---|
| Maximum | 571475 |
| Zeros | 66011 |
| Zeros (%) | 66.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 9029.75 |
| 95-th percentile | 57993.75 |
| Maximum | 571475 |
| Range | 571475 |
| Interquartile range (IQR) | 9029.75 |
Descriptive statistics
| Standard deviation | 26463.97217 |
|---|---|
| Coefficient of variation (CV) | 2.446728198 |
| Kurtosis | 40.04623822 |
| Mean | 10816.06539 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.920542743 |
| Sum | 1081346953 |
| Variance | 700341823 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 66011 | |
| 300 | 40 | < 0.1% |
| 2990 | 37 | < 0.1% |
| 700 | 27 | < 0.1% |
| 31067 | 26 | < 0.1% |
| 3385 | 23 | < 0.1% |
| 5290 | 21 | < 0.1% |
| 2641 | 20 | < 0.1% |
| 30 | 18 | < 0.1% |
| 100 | 18 | < 0.1% |
| Other values (22570) | 33735 |
| Value | Count | Frequency (%) |
| 0 | 66011 | |
| 1 | 9 | < 0.1% |
| 2 | 2 | < 0.1% |
| 3 | 17 | < 0.1% |
| 4 | 14 | < 0.1% |
| 5 | 6 | < 0.1% |
| 6 | 1 | < 0.1% |
| 7 | 3 | < 0.1% |
| 9 | 1 | < 0.1% |
| 10 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 571475 | 1 | < 0.1% |
| 509348 | 1 | < 0.1% |
| 490672 | 4 | |
| 452715 | 1 | < 0.1% |
| 451351 | 1 | < 0.1% |
| 447678 | 1 | < 0.1% |
| 418519 | 1 | < 0.1% |
| 413220 | 4 | |
| 392076 | 1 | < 0.1% |
| 391836 | 1 | < 0.1% |
sum_capital_paid_account_12_24m
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 16667 |
|---|---|
| Distinct (%) | 16.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6542.895325 |
| Minimum | 0 |
|---|---|
| Maximum | 341859 |
| Zeros | 74788 |
| Zeros (%) | 74.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 85 |
| 95-th percentile | 39200.75 |
| Maximum | 341859 |
| Range | 341859 |
| Interquartile range (IQR) | 85 |
Descriptive statistics
| Standard deviation | 19041.22359 |
|---|---|
| Coefficient of variation (CV) | 2.910213696 |
| Kurtosis | 43.76498797 |
| Mean | 6542.895325 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.410064826 |
| Sum | 654132503 |
| Variance | 362568195.6 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 74788 | |
| 300 | 72 | 0.1% |
| 96974 | 37 | < 0.1% |
| 20390 | 30 | < 0.1% |
| 3485 | 27 | < 0.1% |
| 2190 | 25 | < 0.1% |
| 5990 | 22 | < 0.1% |
| 50 | 21 | < 0.1% |
| 895 | 20 | < 0.1% |
| 3190 | 19 | < 0.1% |
| Other values (16657) | 24915 | 24.9% |
| Value | Count | Frequency (%) |
| 0 | 74788 | |
| 1 | 6 | < 0.1% |
| 2 | 5 | < 0.1% |
| 3 | 6 | < 0.1% |
| 4 | 3 | < 0.1% |
| 5 | 2 | < 0.1% |
| 6 | 4 | < 0.1% |
| 7 | 2 | < 0.1% |
| 9 | 1 | < 0.1% |
| 10 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 341859 | 1 | |
| 336568 | 2 | |
| 333900 | 1 | |
| 321102 | 1 | |
| 315838 | 1 | |
| 314738 | 1 | |
| 314503 | 1 | |
| 313502 | 2 | |
| 310682 | 1 | |
| 307975 | 1 |
sum_paid_inv_0_12m
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 38387 |
|---|---|
| Distinct (%) | 38.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 39208.80222 |
| Minimum | 0 |
|---|---|
| Maximum | 2962870 |
| Zeros | 21692 |
| Zeros (%) | 21.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2600 |
| median | 15995 |
| Q3 | 43844.25 |
| 95-th percentile | 150288.5 |
| Maximum | 2962870 |
| Range | 2962870 |
| Interquartile range (IQR) | 41244.25 |
Descriptive statistics
| Standard deviation | 90649.28528 |
|---|---|
| Coefficient of variation (CV) | 2.311962624 |
| Kurtosis | 411.8589793 |
| Mean | 39208.80222 |
| Median Absolute Deviation (MAD) | 15995 |
| Skewness | 15.52557252 |
| Sum | 3919939211 |
| Variance | 8217292921 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 21692 | 21.7% |
| 895 | 269 | 0.3% |
| 1790 | 153 | 0.2% |
| 2000 | 99 | 0.1% |
| 1000 | 94 | 0.1% |
| 2290 | 90 | 0.1% |
| 1990 | 81 | 0.1% |
| 3290 | 77 | 0.1% |
| 2490 | 75 | 0.1% |
| 1290 | 75 | 0.1% |
| Other values (38377) | 77271 |
| Value | Count | Frequency (%) |
| 0 | 21692 | |
| 90 | 1 | < 0.1% |
| 175 | 2 | < 0.1% |
| 210 | 1 | < 0.1% |
| 270 | 2 | < 0.1% |
| 290 | 2 | < 0.1% |
| 295 | 4 | < 0.1% |
| 300 | 3 | < 0.1% |
| 320 | 1 | < 0.1% |
| 360 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 2962870 | 1 | < 0.1% |
| 2853992 | 1 | < 0.1% |
| 2835652 | 2 | |
| 2792694 | 1 | < 0.1% |
| 2789204 | 3 | |
| 2768835 | 3 | |
| 2746889 | 3 | |
| 2744362 | 2 | |
| 2725405 | 2 | |
| 2719917 | 1 | < 0.1% |
time_hours
Real number (ℝ≥0)
| Distinct | 50650 |
|---|---|
| Distinct (%) | 50.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.32977989 |
| Minimum | 0.0002777777778 |
|---|---|
| Maximum | 23.99972222 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 0.0002777777778 |
|---|---|
| 5-th percentile | 7.364930556 |
| Q1 | 11.62270833 |
| median | 15.79277778 |
| Q3 | 19.54201389 |
| 95-th percentile | 22.33972222 |
| Maximum | 23.99972222 |
| Range | 23.99944444 |
| Interquartile range (IQR) | 7.919305556 |
Descriptive statistics
| Standard deviation | 5.031360239 |
|---|---|
| Coefficient of variation (CV) | 0.3282082505 |
| Kurtosis | -0.2260409869 |
| Mean | 15.32977989 |
| Median Absolute Deviation (MAD) | 3.934444444 |
| Skewness | -0.4981009139 |
| Sum | 1532610.074 |
| Variance | 25.31458585 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 19.32361111 | 11 | < 0.1% |
| 19.6 | 9 | < 0.1% |
| 13.21666667 | 9 | < 0.1% |
| 15.47277778 | 9 | < 0.1% |
| 19.80694444 | 9 | < 0.1% |
| 17.08888889 | 8 | < 0.1% |
| 15.82222222 | 8 | < 0.1% |
| 18.17611111 | 8 | < 0.1% |
| 20.27527778 | 8 | < 0.1% |
| 18.09444444 | 8 | < 0.1% |
| Other values (50640) | 99889 |
| Value | Count | Frequency (%) |
| 0.0002777777778 | 1 | < 0.1% |
| 0.001666666667 | 2 | |
| 0.003333333333 | 1 | < 0.1% |
| 0.003611111111 | 1 | < 0.1% |
| 0.004444444444 | 1 | < 0.1% |
| 0.005555555556 | 1 | < 0.1% |
| 0.006388888889 | 1 | < 0.1% |
| 0.007222222222 | 3 | |
| 0.007777777778 | 1 | < 0.1% |
| 0.008055555556 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 23.99972222 | 2 | |
| 23.99861111 | 1 | < 0.1% |
| 23.99833333 | 1 | < 0.1% |
| 23.99666667 | 1 | < 0.1% |
| 23.99638889 | 4 | |
| 23.99583333 | 1 | < 0.1% |
| 23.99444444 | 2 | |
| 23.99305556 | 2 | |
| 23.99277778 | 1 | < 0.1% |
| 23.99194444 | 1 | < 0.1% |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| account_amount_added_12_24m | account_days_in_dc_12_24m | account_days_in_rem_12_24m | account_days_in_term_12_24m | account_incoming_debt_vs_paid_0_24m | age | avg_payment_span_0_12m | avg_payment_span_0_3m | max_paid_inv_0_12m | max_paid_inv_0_24m | num_active_div_by_paid_inv_0_12m | num_active_inv | num_arch_dc_0_12m | num_arch_dc_12_24m | num_arch_ok_0_12m | num_arch_ok_12_24m | num_arch_rem_0_12m | num_arch_written_off_0_12m | num_arch_written_off_12_24m | num_unpaid_bills | recovery_debt | sum_capital_paid_account_0_12m | sum_capital_paid_account_12_24m | sum_paid_inv_0_12m | time_hours | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 0.0 | 0.0 | 0.0 | 0.000000 | 20 | 12.692308 | 8.333333 | 31638.0 | 31638.0 | 0.153846 | 2 | 0 | 0 | 13 | 14 | 0 | 0.0 | 0.0 | 2 | 0 | 0 | 0 | 178839 | 9.653333 |
| 1 | 0 | 0.0 | 0.0 | 0.0 | NaN | 50 | 25.833333 | 25.000000 | 13749.0 | 13749.0 | 0.000000 | 0 | 0 | 0 | 9 | 19 | 3 | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 49014 | 13.181389 |
| 2 | 0 | 0.0 | 0.0 | 0.0 | NaN | 22 | 20.000000 | 18.000000 | 29890.0 | 29890.0 | 0.071429 | 1 | 0 | 0 | 11 | 0 | 3 | 0.0 | 0.0 | 1 | 0 | 0 | 0 | 124839 | 11.561944 |
| 3 | 0 | NaN | NaN | NaN | NaN | 36 | 4.687500 | 4.888889 | 40040.0 | 40040.0 | 0.031250 | 1 | 0 | 0 | 31 | 21 | 0 | 0.0 | 0.0 | 1 | 0 | 0 | 0 | 324676 | 15.751111 |
| 4 | 0 | 0.0 | 0.0 | 0.0 | NaN | 25 | 13.000000 | 13.000000 | 7100.0 | 7100.0 | 0.000000 | 0 | 0 | 0 | 1 | 0 | 0 | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 7100 | 12.698611 |
| 5 | 0 | 0.0 | 0.0 | 0.0 | NaN | 18 | NaN | NaN | 0.0 | 0.0 | NaN | 0 | 0 | 0 | 0 | 0 | 0 | NaN | NaN | 0 | 0 | 0 | 0 | 0 | 18.328333 |
| 6 | 0 | 0.0 | 142.0 | 0.0 | 0.000000 | 49 | 3.000000 | 3.000000 | 2373.0 | 2373.0 | 0.000000 | 0 | 0 | 0 | 1 | 0 | 0 | 0.0 | 0.0 | 0 | 0 | 18760 | 8337 | 2373 | 10.244444 |
| 7 | 57229 | 0.0 | 0.0 | 0.0 | 0.232244 | 34 | 26.930233 | 25.866667 | 8655.0 | 9645.0 | 0.083333 | 20 | 0 | 0 | 215 | 257 | 0 | 0.0 | 0.0 | 37 | 0 | 42206 | 35336 | 457257 | 12.192778 |
| 8 | 148922 | 0.0 | 47.0 | 0.0 | 0.969055 | 40 | 33.727273 | 37.571429 | 6075.0 | 9090.0 | 0.818182 | 9 | 0 | 0 | 3 | 2 | 3 | 0.0 | 0.0 | 23 | 0 | 104643 | 32381 | 24390 | 21.411111 |
| 9 | 0 | 0.0 | 0.0 | 0.0 | NaN | 47 | 21.000000 | 21.250000 | 36985.0 | 36985.0 | 0.000000 | 0 | 0 | 0 | 5 | 10 | 0 | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 78620 | 13.340833 |
Last rows
| account_amount_added_12_24m | account_days_in_dc_12_24m | account_days_in_rem_12_24m | account_days_in_term_12_24m | account_incoming_debt_vs_paid_0_24m | age | avg_payment_span_0_12m | avg_payment_span_0_3m | max_paid_inv_0_12m | max_paid_inv_0_24m | num_active_div_by_paid_inv_0_12m | num_active_inv | num_arch_dc_0_12m | num_arch_dc_12_24m | num_arch_ok_0_12m | num_arch_ok_12_24m | num_arch_rem_0_12m | num_arch_written_off_0_12m | num_arch_written_off_12_24m | num_unpaid_bills | recovery_debt | sum_capital_paid_account_0_12m | sum_capital_paid_account_12_24m | sum_paid_inv_0_12m | time_hours | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 99966 | 16642 | 0.0 | 59.0 | 0.0 | 0.000000 | 40 | 40.000000 | 16.000000 | 11835.0 | 11835.0 | 0.142857 | 1 | 0 | 0 | 2 | 5 | 5 | 0.0 | 0.0 | 2 | 0 | 12738 | 10852 | 44244 | 17.471389 |
| 99967 | 32355 | 0.0 | 125.0 | 44.0 | 0.665829 | 28 | 12.000000 | NaN | 9330.0 | 9330.0 | 0.000000 | 0 | 0 | 0 | 1 | 0 | 0 | 0.0 | 0.0 | 16 | 0 | 20633 | 22355 | 10225 | 13.057500 |
| 99968 | 0 | 0.0 | 0.0 | 0.0 | NaN | 45 | 17.090909 | 20.714286 | 12264.0 | 12264.0 | 0.037975 | 3 | 0 | 0 | 77 | 44 | 0 | 0.0 | 0.0 | 3 | 0 | 0 | 0 | 276135 | 19.786944 |
| 99969 | 0 | 365.0 | 0.0 | 0.0 | 0.371604 | 31 | NaN | NaN | 895.0 | 895.0 | NaN | 0 | 0 | 0 | 0 | 0 | 0 | NaN | NaN | 2 | 0 | 7695 | 1025 | 895 | 11.290833 |
| 99970 | 88405 | 0.0 | 15.0 | 0.0 | 0.672000 | 21 | NaN | NaN | 0.0 | 6242.0 | NaN | 0 | 0 | 0 | 0 | 2 | 0 | 0.0 | 0.0 | 9 | 0 | 28870 | 25771 | 0 | 9.060833 |
| 99971 | 0 | 0.0 | 0.0 | 0.0 | NaN | 33 | 10.333333 | NaN | 35195.0 | 35195.0 | 0.000000 | 0 | 0 | 0 | 6 | 2 | 0 | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 60127 | 10.765556 |
| 99972 | 0 | 0.0 | 0.0 | 0.0 | 0.004044 | 44 | 36.000000 | NaN | 4740.0 | 4740.0 | 0.000000 | 0 | 0 | 0 | 1 | 3 | 0 | 0.0 | 0.0 | 1 | 0 | 7948 | 0 | 4740 | 21.708333 |
| 99973 | 45671 | 0.0 | 20.0 | 0.0 | 0.705078 | 24 | NaN | NaN | 1200.0 | 1200.0 | NaN | 0 | 0 | 0 | 0 | 0 | 0 | NaN | NaN | 18 | 0 | 17447 | 19627 | 3100 | 2.185278 |
| 99974 | 56102 | 0.0 | 0.0 | 0.0 | 0.064175 | 31 | 17.500000 | NaN | 15000.0 | 15000.0 | 0.000000 | 0 | 0 | 0 | 2 | 1 | 0 | 0.0 | 0.0 | 1 | 0 | 18339 | 56180 | 34785 | 9.725278 |
| 99975 | 0 | 0.0 | 0.0 | 0.0 | NaN | 41 | 34.666667 | 37.500000 | 13246.0 | 14817.0 | 0.000000 | 0 | 0 | 0 | 2 | 2 | 1 | 0.0 | 0.0 | 1 | 0 | 0 | 0 | 30602 | 11.585278 |